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Abstract — For every Gaussian network, there exists a corre- 
sponding deterministic network called the discrete superposition 
network |1|. We show that this discrete superposition network 
provides a near-optimal digital interface for operating a class 
consisting of many Gaussian networks in the sense that any code 
for the discrete superposition network can be naturally lifted to 
a corresponding code for the Gaussian network, while achieving 
a rate that is no more than a constant number of bits lesser than 
the rate it achieves for the discrete superposition network. This 
constant depends only on the number of nodes in the network 
and not on the channel gains or SNR. Moreover the capacities 
of the two networks are within a constant of each other, again 
independent of channel gains and SNR. We show that the class 
of Gaussian networks for which this interface property holds 
includes relay networks with a single source-destination pair, 
interference networks, multicast networks, and the counterparts 
of these networks with multiple transmit and receive antennas. 

The code for the Gaussian relay network can be obtained 
from any code for the discrete superposition network simply by 
pruning it. This lifting scheme establishes that the superposition 
model can indeed potentially serve as a strong surrogate for 
designing codes for Gaussian relay networks. 

We present similar results for the KxK Gaussian interference 
network, MIMO Gaussian interference networks, MIMO Gaus- 
sian relay networks, and multicast networks, with the constant 
gap depending additionally on the number of antennas in case 
of MIMO networks. 

Index Terms — approximate capacity, deterministic model, dis- 
crete superposition model, interference networks, relay networks 



I. Introduction 

Computing the capacities of wireless networks is a 
formidable problem. It has been a central theme of network in- 
formation theory, where research has attempted to characterize 
the information-theoretic limits of data transmission in various 
abstract models of networks. Of potentially equal importance, 
and also of great interest, has been the problem of construction 
of optimal coding schemes which achieve the promised rates. 
However, except for some examples of networks like the MAC 
channel 133], ©| and the broadcast channel JHl, ISH, it has 
so far not been possible to accurately compute the capacities 
of even simple networks like the relay channel lil4J . [|15I 
involving a small number of nodes. 
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This has motivated the pioneering line of work in H-Q 
which aims at developing a class of noiseless deterministic net- 
works that can serve as surrogates for Gaussian networks and 
suggest near-optimal coding schemes for Gaussian networks. 
Here the stringent goals of network information theory are 
relaxed by allowing for answers which are within a bounded 
gap of the capacity. By bounded gap is meant a constant that 
is independent of the channel gains or SNR, and is a function 
of only the number of nodes in the network. This relaxation 
in the problem has yielded new insights which have inspired 
construction of near-optimal coding schemes and improved our 
understanding of the fundamental limits imposed by various 
network configurations 13], Hi], H, ||30l, lITOl . 

In this paper, we focus on rigorously studying the use of 
deterministic models both for approximating the capacity of 
relay and interference networks, as well as for the construction 
of coding schemes. We prove that a particular deterministic 
model, called the discrete superposition model, can serve as a 
digital interface for operating these Gaussian networks in that 
any coding scheme for this discrete superposition model can 
be naturally lifted to the Gaussian networks with no more than 
a bounded loss in the rate for each user 

A. Relay networks 

The relay channel was introduced by Van der Muelen in 
lfT4ll . and certain achievable rates were determined. Cover and 
El Gamal ifTSJI subsequently studied the network in detail 
and developed two coding schemes which are now known 
as decode-and-forward and compress-and-forward. They also 
provided an upper bound on the capacity of the relay channel; 
which is now referred to as the cut-set bound. The cut-set 
bound was developed in full generality for networks in |16|, 

m. 

The decode-and-forward scheme was extended to networks 
with many relays in ifTTl . lITSI . and compress-and-forward was 
extended to larger networks in 1191 . All these papers intro- 
duced certain novelties in existing coding schemes, though the 
best upper bound on the capacity of relay networks continues 
to be the cut-set bound. These schemes do not generally 
provide a guarantee of near optimality, in the sense that the gap 
between the achievable rates for any of these coding schemes 
and the cut-set bound could be arbitrarily high as a function 
of SNR. So, in effect, it has not been clear whether it is the 
cut-set bound that is weak or whether it is the coding schemes 
that are not exploiting all the features available in the wireless 
network. 



This impasse motivated the work of {4'\, which studied 
coding schemes with provable guarantees on performance. 
This was done through the approach of introducing a linear 
deterministic model, where the hnearity is over a finite field. 
The linear deterministic model captures the broadcast and in- 
terference aspects of wireless networks. The capacity of linear 
deterministic relay networks was determined to be precisely 
the same as the cut-set bound, which was additionally shown 
to be achievable by random coding at the nodes. Further, 
coding schemes were developed for general deterministic 
relay networks in (5], where the received signal at a node 
is a function of the signals transmitted by the neighboring 
nodes. The coding schemes for the Unear deterministic model 
motivated the construction of coding schemes for Gaussian 
relay networks in |3|, |6|, |7|, which are provably near-optimal 
for relay networks in the sense that they achieve all rates 
within a bounded gap from the cut-set bound. This result 
also establishes as a corollary that the cut-set bound for the 
Gaussian relay network is indeed approximately the capacity 
of the Gaussian relay network. Recently, the gap of between 
the achievable rate for the relay network and the cut-set bound 
has been further reduced by a generalization of compress-and- 
forward |l9l|. 

However, the above-mentioned results do not establish the 
closeness of the capacities of the linear deterministic network 
and the Gaussian network. Also, the near-optimal coding 
schemes for the Gaussian network are certainly inspired by 
the coding scheme for the linear deterministic counterpart, but 
there is no rigorous procedure to design a coding scheme for 
the Gaussian network given a coding scheme for the linear 
deterministic network, and consequently also no procedure 
for lifting codes from one to the other while preserving 
near-optimality. Hence it is unclear if a deterministic model 
only serves to aid the intuition in the construction of coding 
schemes for Gaussian networks, or if there is a more funda- 
mental connection between the capacities of the deterministic 
network and the Gaussian network. More importantly, it is 
not clear if a coding strategy for a deterministic model can be 
explicitly used for designing a coding strategy for the Gaussian 
network that achieves comparable performance. We answer 
these questions in the affirmative here. 

The linear deterministic model however does not approx- 
imate the capacity of Gaussian networks in general |3l, Q, 
lH]. The question that therefore arises is whether there is a 
procedure for constructing for every Gaussian relay network 
a corresponding deterministic network in such a way that 
the capacities of the Gaussian and the deterministic network 
are within a bounded gap. This has been done in [|3|, Q 
and im via different models. In the truncated deterministic 
model in O, l?) the channel gains and inputs are complex 
valued, while the complex channel output has integer real 
and imaginary parts. The model used in |1|, the discrete 
superposition model, is a somewhat more discrete model in 
the sense that channel gains and inputs are discrete valued. 
Hence corresponding to any Gaussian network, one can indeed 
construct a discrete superposition network which has the same 
topology as the Gaussian network and captures the effects of 
broadcast, interference, and attenuation by channel gains in 



a wireless network (see Section |II-C| l. As mentioned in |T|, 
|T1, the bounded gap in capacities of the deterministic and 
Gaussian model does not necessarily imply a correspondence 
between coding schemes for the two models. 

The next question that arises in this program of rigorous ap- 
proximation of Gaussian networks by deterministic networks 
is whether one can also rigorously recover, with performance 
not decreasing by more than a constant number of bits, near- 
optimal coding schemes for Gaussian relay networks from 
coding schemes for discrete superposition relay networks. We 
establish this result in the affirmative. In fact, we show a 
stronger property that every coding scheme for the discrete 
superposition network can be simply mapped to a similar 
scheme for the Gaussian relay network in such a way that it 
continues to provide comparable performance. The lifting pro- 
cedure therefore works uniformly over the class of all coding 
schemes, producing uniformly close performance. The lifting 
procedure is particularly natural and consists essentially of just 
pruning the codewords and using jointly typical decoding. 

Thus the discrete superposition model provides a near- 
optimal digital interface for operating a Gaussian relay net- 
work. We extend this correspondence to MIMO relay networks 
and multicast networks. The superposition model may poten- 
tially be easier to design codes for than the Gaussian model 
since noise has been eliminated from the network and the sets 
of inputs and outputs are finite. Potentially, perhaps, wireless 
network coding could be useful in studying superposition 
networks, and subsequently Gaussian networks. This remains 
an open and intriguing question. 

B. Interference networks 

Interference networks have received much attention recently, 
following the results in ll20l . In 1201 . the capacity of the 
interference channel with two transmitters and two receivers 
is determined within a constant gap of 1 bit. The near-optimal 
coding scheme in |20| is a specific choice among the myriad 
strategies proposed in ll2Tl . A simpler proof of the result in 
11201 is provided in fS?!. This was independently strengthened 
in ll26l . Il27l . and |28| where treating the interference as noise 
is shown to be capacity achieving for a restricted range of the 
parameters corresponding to a low interference regime. The 
capacity region of the 2x2 deterministic interference channel 
was determined in 1221 . In l8|, it is shown that the capacities of 
the linear deterministic interference channel and the Gaussian 
interference channel are within a bounded gap. A variant of the 
discrete superposition model was first used in JS) in a sequence 
of networks that reduced the 2x2 Gaussian interference 
channel to a linear deterministic interference channel. 

Much less is known for Gaussian interference networks 
with more than two users. The number of degrees of freedom 
of the time-varying interference channel was characterized 
in ESJI using the idea of interference alignment, and they 
were characterized for specific interference channels with fixed 
gains in |30|. Generalized degrees-of- freedom region of the 
fully symmetric many-user interference channel was computed 
in Ii29j . In general, the capacity region of the interference 
networks with three or more users is unknown, even to within 
a constant gap. 



In this paper we prove that the rate region of the Gaussian 
interference network and the corresponding discrete superpo- 
sition interference network are within a bounded number of 
bits. In fact, this correspondence extends to the case where the 
nodes have multiple transmit and receive antennas. 

We can also prove the stronger result that the discrete 
superposition model provides a digital interface for operating 
the Gaussian interference network, in the sense, as above, that 
any coding scheme for the discrete superposition interference 
network can be lifted naturally to a scheme for the Gaussian 
interference network, with no more than a bounded loss in the 
rate for each user This correspondence is extended to MIMO 
Gaussian interference networks where nodes can have multiple 
transmit and receive antennas. 

C. Outline of the paper 

In Section [n] we introduce the various models used in this 
paper, viz. the Gaussian model, the linear deterministic model, 
and the discrete superposition model. 

We develop the digital interface property for operating 
Gaussian relay networks using the discrete superposition 
model (Section IIIi. The capacities of the Gaussian and 



discrete superposition relay networks are within a bounded 
gap. We rigorously prove a procedure for lifting any code for 
the discrete superposition relay network to the Gaussian relay 
network, and prove that there is no more than a bounded loss 
in the rate (Section [III-B| |. 



We show in Section 



IV that the rate regions of the Gaussian 



interference network and the discrete superposition interfer- 
ence networks are within a bounded gap. This proof subsumes 
the procedure for lifting a code from the discrete superposition 
interference network to the Gaussian interference network. 

We also address the near-optimality of the digital interface 
defined by the discrete superposition model for MIMO 
channels, MIMO relay networks, MIMO interference 
networks, and multicast networks (Section [V| ). 

Notation : We will denote all random variables by lower 
case letters. Random vectors will be denoted by underbars, 
for example, as x = [xi,X2, ■ ■ ■ ,Xm, . . . ,xn)- If x is a 
complex vector, then the m-th term in the vector is given by 
Xm = x„iji + iXmi, where XmR and Xmi are the respective 
real and imaginary parts of Xm- 

We denote the quantization of a complex number x by [x], 
where 



fiign{xB)[\xR\\ +isign(a::/)[|x/|J. 



(1) 



In the above, [-J is the standard floor function. 

Abusing notation, we will use the same letter to refer to 
the random variable and its realization. All logarithms in the 
paper are to the base 2. 

II. Models for networks 

A. Gaussian networks 

We begin by describing the class of Gaussian networks 
of interest. We consider a wireless network represented as a 
directed graph (V, f ), where V represents the set of nodes, and 



the directed edges in £ correspond to wireless links. In the case 
of relay networks, we label the nodes in V as {0, 1, ... , Af }, 
where is the source, M is the destination, and the remaining 
are relay nodes. In the case of KxK interference networks, we 
divide the nodes in V into two sets; the first set {1,2,..., K} 
consisting of the transmitters, and the second set {1, 2, ... , K} 
consisting of the receivers. Though we use the same numbers 
to denote the sources and the destinations, it will be clear from 
the context which node we are referring to. 

Denote by htj the complex channel gain for link (i, j) G £. 
Let the complex number Xi denote the transmission of node 
i. Every node has an average power constraint, taken to be 1. 
Node i receives 






fl'ijXi -j- Zj 



(2) 



where N{j) = {i : (i,j) G £} is the set of its neighbors, 
Zj is CA/'(0, 1) complex white Gaussian noise independent of 
transmitted signals, and /lij = hiju + ihiji. 

For MIMO networks, the nodes are allowed to have multiple 
transmit and receive antennas. The transmitted and received 
signals are described by vectors, where the number of elements 
in the vector corresponds to the number of transmit/receive 
antennas. 



B. The linear deterministic model 

The linear deterministic model was introduced in f4l as 
an approximate model to capture certain aspects of wireless 
networks. It should be noted that the linearity is with respect 
to the finite field F2. 

The linear deterministic model is constructed based on the 
given Gaussian network as follows. We begin by choosing all 
the inputs and outputs of channels to be binary vectors of 
length max(i j)££ [log |/ijjpj. Each link with channel gain h 
in the Gaussian network is replaced by a matrix that shifts 
the input vector, allowing [log|/ipJ most significant bits of 
the input to pass through. At a receiver, shifted binary vectors 
from multiple inputs are added bit by bit over the binary field. 
This models the partially destructive nature of interference in 
wireless. Modeling the broadcast feature of wireless networks, 
a node transmits the same vector on all outgoing links, albeit 
with different attenuations. 

As an example, consider the Gaussian network in Fig. |l(a)| 
and the corresponding linear deterministic network in 
Fig. 



1(b) 



All inputs and outputs of channels are vectors in 
F2 of length 3. The channel is simply a linear transformation 
over the binary field. In the sequel, we will consider the high 
SNR scaling of a Gaussian network as all the channel gains 
{hij : {i,i) e £} are scaled by a large positive constant 7 to 
give ^hij : {i,j) £ £. We will study the relationship between 
the capacities of the high SNR scaled network and the linear 
deterministic approximation. 

1) Inability of the linear deterministic model to capture 
phase and power: The linear deterministic model cannot 
capture the received signal power in certain Gaussian networks 
13], m. Also, it does not capture the phase of the channel gain 
in a Gaussian network IT], as shown below via an example. 




(a) Example of a Gaussian network. 




(b) Linear deterministic network. 
'3^ 




(c) Discrete superposition network. 

Fig. 1. A Gaussian network, linear deterministic counterpart, and discrete 
superposition network. 



Due to these facts, in general, the model does not approximate 
the capacity of a Gaussian network to within a bounded 
number of bits. 




(a) A Gaussian network. 






Llog |h|2j 
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(b) Portion of deterministic network. 

Fig. 2. Example showing that the linear deterministic model does not 
approximate the capacity of the Gaussian network to within a bounded number 
of bits. 



Example: It is known that the cut-set bound is a near- 
optimal measure of the capacity of both the Gaussian and 
the linear deterministic network |[3l, IJS). Now consider the 
Gaussian network in Fig. [2] where the channels marked as 
CO have very high capacity. For this network, the cut-set 
bound is determined by the mutual information across the cut 



17 = {0, 1, 2}, which is given b>Q 

I{xa■,yn^xn^) = \og\I + HH^\ 

= 41og|/i|+0(l), as \h\ ^00, 

with 

n = 



^h h 
h h 



Hence, the capacity of the network is approximately 4 log \h\. 

A portion of the linear deterministic model for the Gaussian 
network is shown in Fig. |2(b)| The capacity of the linear 
deterministic network is the rank of the transfer matrix of 
n = {0,1,2} which is [2 log |/i|J. 

The gap between the capacities of the Gaussian network in 
Fig. |2(a)| and its linear deterministic counterpart is therefore 
2 log \h\ + 0(1). It is unbounded as \h\ — > 00. 

The main reason for the unboundedness of the gap is that 
the linear deterministic model only considers the magnitude 
of a channel gain and effectively replaces each channel gain 
by a power of 2 that is nearest to its magnitude. Hence it 
does not capture the phase of the channel gain in Fig. 2(a) 



Constructing the deterministic model over a larger prime field 
(than F2) does not circumvent this problem. 

C. The discrete superposition model 

The above example motivates the search for a deterministic 
model that does approximate the capacity of a Gaussian 
network to within a bounded number of bits. We now describe 
such a model. We associate a noiseless deterministic model 
with a Gaussian network, as follows. Let 



max max{ [log |/iiji?| 



[log\h.,,i\\}. (3) 



The inputs to the deterministic model are complex valued, 
with both real and imaginary parts taking values from the 
2" equally spaced discrete points {0, 2~",...,1 — 2~"}, 
scaled by l/\/2. The real or imaginary part of an input 
can accordingly be represented with the binary representation 
a; = -^ X^Li 2~''a;(fc), with each x{i) e ¥2. 

Next we quantize the real and imaginary parts of channel 
gains in the Gaussian network to integers by neglecting their 
fractional parts, as in ([T]i. The resulting quantized channel gain 
for link (i,j) is given by [hij]. 

The operation of the channel in the discrete superposition 
model is as follows. The channel between two nodes i and 
j in the discrete superposition network multiplies the input 
by the corresponding channel gain [hij] and quantizes the 
product by neglecting the fractional components of both real 
and imaginary parts, i.e., it forms [[/i^-Jxi]. Note that [[/lijjxi] 
lies in Z + iZ. These values are then added up at a receiver 
by the standard summation over Z + iZ. The received signal 
at node j is therefore given by 



Vi 






'^ijl'^i 



(4) 



'We chose i.i.d. J\f (0,1) inputs since this choice maximizes the mutual 
information, up to a constant |6). 



Note that this model retains the essential superposition 
property of the wireless channel. Quantization of channel 
coefficients does not substantially change the channel matrix 
in the high SNR regime. Also, the blurring effect of noise is 
captured by constraining the inputs to positive fractions that 
can be represented by finite bits, as well as by quantization of 
the channel output. 

Finally we note that the inputs in the discrete superposition 
model are restricted to be peak power constrained by the same 
value as the average power constraint on the transmit signals 
in the original Gaussian network. An important property of 
the discrete superposition model is therefore that the transmit 
signals in it are thus also valid for transmission in the Gaussian 
network. That is, encoder outputs in the discrete superposition 
network can also be used in the Gaussian network. We will see 
later in the sequel that this is important when we show how 
coding strategies in the discrete superposition network can be 
mapped to coding strategies in the Gaussian network. 

For example, consider the Gaussian relay network in 



Fig. 1(a) and its discrete superposition counterpart in Fig. 1(c) 



The discrete superposition model preserves multiplication by 
channel gains and the superposition property of the channel, 
with the Gaussian noise replaced by quantization of outputs. 
The discrete superposition model was first used in |8| in a 
sequence of networks that reduced the Gaussian interference 
channel to a linear deterministic interference channel. Here, 
it was shown that the capacities of the two-user Gaussian 
interference channel and the discrete superposition model of 
the same network are within a bounded gap. The model was 
generalized and given the name of a superposition model in 
([T] to distinguish it from the linear (over a vector space) 
deterministic model. 

III. A DIGITAL INTERFACE FOR GAUSSIAN RELAY 
NETWORKS 

In this section, we study the class of Gaussian relay net- 
works with a single source-destination pair. Here, the aim is 
to prove that the discrete superposition model for the relay 
network provides a near-optimal digital interface for operating 
the network. An example of a Gaussian relay network, with 



source 0, and destination 6 is shown in Fig. 1(a) In general, we 
consider networks with il/ + 1 nodes, with the nodes labeled 
as 0, 1, ... , M, where node is the source, node M is the 
destination, and the remaining nodes are relay nodes. The relay 
nodes have no information of their own to transmit, and assist 
the source in transmitting its data to the destination. 

The description of a coding scheme for a relay network 
is somewhat involved due to the functions applied on the 
incoming data by the relay nodes. Also, due to the presence 
of cycles in the network, the encoding functions of the nodes 
can vary with time. In general, we define a {2^^,N) code 
for a relay network to be an encoding function for the source 

xo-. {1,2,...,2^«}^A'^, 

where X is the input alphabet of the channel, and a set of 
encoding functions for relay node k. 



where 3^^ is the alphabet of node fc's received signal. For sake 
of simplicity, we assume that the input alphabet of each relay 
node is the same. As mentioned before, the encoding function 
of a relay node can vary with time, and the transmitted symbols 
can depend on all the received symbols previously received by 
the relay. The destination AI's decoding function is given by 



9M ■■ y^i 



{l,2,...,2^^n, 



where J^m is the alphabet of the received signal of the A/-th 
node. Let A^ be a random variable uniformly distributed on 
{1, 2, . . . , 2^^} that corresponds to the message that source 
wants to communicate. Then A4 is mapped to the codeword 
a;Q(7VJ). The average probability of error is given by 



P. 



Pr{9M{yJ ^ M), 



where y is the signal received by the destination node M. 
The capacity of the relay channel is the supremum of all rates 
R such that for any e > 0, there exists a blocklength N for 
which Pe < e. 

It is easy to show that any achievable rate below the capacity 
can be described by a multi-letter mutual information term: 

Lemma 3.1: A rate R lies in the capacity region of the 
relay network if and only if there exists a blocklength N, a 
distribution p{xq), and a set of distributions {p{xi.\y ),k = 
1, 2, . . . , M — 1} conforming to the causal constraints on the 
encoding functions at the relays as mentioned above such that 



R < 



1 

N 



^feo;2^«r) 



(5) 



where y is the received signal at the destination. 



Proof: We note a subtlety in computing the mutual in- 
formation I{xQ;y) where the effect of the various encoding 
functions applied at the relay nodes is captured in the joint 
distribution of Xq and y . 

Suppose we are given a distribution p{xq) on input vectors 
of length N and a set of relay encoding functions {pixk\y,.)}- 
Construct a collection of 2™^^ codewords for the source 
where each codeword is constructed by independently picking 
m vectors, each of length N, with the distribution p{xq) 
and appending them to get codewords of length niN. The 
encoding functions at the intermediate relay nodes are given by 
the transition probability kernels {p(a;j,|y )}. The destination 
decodes by looking for a codeword jointly typical with its 
reception. Using standard random coding arguments (see |35)), 
the probability of error can be made arbitrarily small by 
considering large enough m. This proves the achievability of 
the rate. 

For proving the converse, fix a (2^^, N) code and observe: 



NR 



= H{M) 
- I{M;i 
< I{M;y 



HM;yJ 



M' 



1 + P^NR 



< I{x^-,yJ + l + PeNR, 



(6) 
(7) 



9k,'. 



yV"-^ -^ X, for m 



1,2, 



,N, A: = 1,2 M-1, 



where we used Fano's lemma in (|6]) and the data processing 
inequality in (|7]i. Since the rate R is achievable, P,, can be 



made arbitrarily small for sufficiently large N. On dividing 
both sides of (|7| by N, we get the converse. ■ 

The subtlety in the above proof regarding the mappings at 
the encoders makes the proof of the near-optimality of using 
the discrete superposition model as a digital interface for relay 
networks more involved than in the case of the interference 
channel, which is analyzed subsequently in a later section. 

The next theorem indicates the relevance of the discrete 
superposition model in analyzing relay networks. 

Theorem 3.2: Consider the relay network described above. 
The capacity Cq of the Gaussian relay network and the 
capacity Co of the discrete superposition relay network is 
within a bounded gap of k-ji bits where 



KK 



0{M log M). 



(8) 



Note that the above theorem does not establish any cor- 
respondence between coding schemes for the two networks. 
We prove the converse part of the theorem next, i.e., Co > 
Cg ~ 0{M log M), while the remainder of the proof is 
presented in the subsequent section via a stronger argument 
concerning the lifting of coding schemes from the discrete 
superposition model to the Gaussian model. 

A. Capacity( Discrete superposition network) > Capac- 
ity(Gaussian network), to within an additive constant 

Lemma 3.3: Let Cq be the capacity of the Gaussian relay 
network, and Co be the capacity of the discrete superposition 
relay network. Then, 



Cd > Cg - 0{M log M). 



(9) 



Proof: The cut-set bound lfT6l on the capacity in a 
single source-destination pair relay network, with source and 
destination M, is given by 

C < max min /(xq; yncjxnOi (10) 

p{xo,xi,...,xm-i) f26A 

where C is capacity of the network and A is the set of all 
partitions of V with e ft and M £ il''. In fl] it is proved that 
the cut-set bound for a Gaussian network is achievable up to a 
bounded gap, where the gap is at most 0{M log M). It is also 
proved that while computing the cut-set bound for a Gaussian 
relay network, we can choose the random variables in the 
optimization to be i.i.d. complex Gaussian CAf{0, 1). With this 
choice, the cut-set bound evaluates to within 0{M log AI) of 
the maximum value. 

It is also proved in [7| that the cut-set bound is achievable 
for a general class of deterministic networks, of which the 
discrete superposition network is a special case, when the 
maximization in the cut-set bound is restricted to independent 
random variables. 

We consider a particular cut in a network. We start with 
the Gaussian model and choose the random variables corre- 
sponding to the cut to be i.i.d. Gaussian. We reduce the Gaus- 
sian network to a discrete superposition network in stages, 
bounding the loss in mutual information across the cut at each 



stage. At the end, we will have a distribution for the inputs 
for the discrete superposition network where all the inputs are 
independent. We will prove that the total loss in the mutual 
information as a result of these transformations is at most 
0(A/logA/). Repeating this across all the cuts in the network 
proves that the capacity of the discrete superposition network 
is within 0(Af logAf) bits of the capacity of the Gaussian 
network. 

Now consider a cut fl containing P nodes (including the 
source) and il^ containing Q nodes (including the destination), 
where P + Q — AI + 1. For the sake of simplicity, we assume 
that all the nodes in fl are connected to all the nodes in fl'^, 
and the nodes in 51"^ are not connected among themselves^ 
The received signals in il"^ are given by 

p-i 
Vj = X! ^*J^« + 2j , j = P, . . . , M. (11) 

Choose Xi as i.i.d. CAf{0, 1). Each transmitted signal Xi 
can be split into its quantized part [xi] and fractional part Xi, 
where 



We discard the quantized part [xi] of all the transmitted signals 
and and retain Xi. Since x/s satisfies a unit average power 
constraint, Xi's also satisfies a unit average power constraint. 
Define 

p-i 

Vi •= X! ^^i S:i + Zj, j = P,...,M. (12) 

4=0 

The discarded portion of the received signals is given by 

p-i 
Vj - y"%N, i^P,...,M. (13) 



The mutual information across the cut il for channels ( [TT] i and 
([12]) can be compared as 

I{xn;yn.) < I{xn;yn-,m-) 

< I{xn,[xn];yn-,yn-) 

= I{xn,[xn];yn'') + I{S:n,[xn];yn'=\yn') 
= I{xiy,yn-) + I{xn,[xa];ya-\m'=) (14) 

< I{xn;yn-) + H{yna) 



p-i 



< /(5;n;ynO + I]^(N), 



(15) 



where ( [T4| l follows because [xq] — > x^^ — > i/n^ form a Markov 
chain, and ( fTS) holds because yj is a function of {[x^]} from 
( [T3| ). It is proved in the appendix that H{[xi]) < 6; hence 
we get 

-^(5n;yn0 > I{xn;yn'=) -6P- (16) 

Since xnj and xu lie in (—1, 1), we obtain positive inputs 
by adding 1 to each. This is equivalent to adding J^i hij{l + i) 

-This ensures that /(xn; yn<:\x(ic) = I{xq; yw)- The more general case 
can also be similarly handled. 



to ijj. We also divide by 2\/2 throughout to get: 



p-i 



fe + E'^^^(l + ''))/2^ 



4 = 
P-1 



= ^h,j{ii + l + i)/2V2 + Zj /2V2. (17) 

i=0 

Lets denote the vector of (1 + i)'s as i'q and note that 

r ( ~ xn + vn ~ 

J ( ~ ^o + t^o . - m- + (Ei hij{i + i))j 



2V2 '"" ' 2^2 

jf xn + vn , m- + (Ei fc»j(l + 0)j ' 
V 2\/2 ' 2V2 

To avoid introducing new notation, for the rest of the proof 
we abuse notation and denote the left hand side of ( [T7] l by yj, 
{xi + (1 + ?))/2y2 by Xj, and Zj/2%/2 by Zj, for all j. 

With the new notation, \xi\ < 1, with positive real and 
imaginary parts, and Zj is distributed as CJV{0, 1/8). 
The features of the model that we next address are: 

1 ) channel gains are quantized to lie in Z + tZ, 

2) real and imaginary parts of the scaled inputs are re- 
stricted to 

n := max max{[log|/iijfl|J, [log|/iy/|J} (18) 

bits, 

3) there is no AWGN, and 

4) outputs are quantized to lie in Z + «Z. 

Let the binary expansion of \/2xiii, be Q.XiR{l)xiR{2) . . ., 
i.e.. 



V ^ „=1 



(19) 



The received signal in the discrete superposition channel only 
retains the following relevant portion of the input signals: 

p-i 

y'j = 5][N].t;], j = o,...,p-i, (20) 



i=0 



where 



^iR 



^il 



1 " 

-^5:x.«(p)2-^ 

^ p—i 

n 

Y.^,,(j>)2-^, 

^ p—1 



with n defined in ([3]). To obtain pO] ) we subtracted 6j from 
yj, where 

p-i 
Sj ■■= E [ht]{x^ - Xi) + (/ly - [hij]) x.^ + (21) 

i=0 



{[h,j]x,- [[h^.j]x,])j +Zj 
p-i 
=: J2 ^»J + ^J 



i=0 

Vi + z. 



To bound the loss in the mutual information across the cut in 
the discrete superposition network from the original Gaussian 
interference network, we have 



li^fi;yn'=) 
< I{xn 
= I{xn 
= I{xn 
= I{xn 



ya<=,yao,(5oc) 

Vn-) + I{xn]5ii.Ayn-=) 

y'a") + HSn-ly'n") - HSn-ly'a- , xn) 

y'n-) + lT'{Sn-) -h{5n'=\yn'=,xn,m) 



< Hx'n;y'^.) + J2iHSj)-hizj)) 
j=P 

AI 

j=p 

By bounding the magnitudes of the terms in ( |2T] i, we get 
\wij\ < 3\/2. So, I{vj]5j) is at most the mutual informa- 
tion of a Gaussian MISO channel with average input power 
constraint less than (3V2)^P < 18P and 



I(vfSj) 



< 



< 



log(l + 18P/(l/8)) 
log(l + 144P). 



(22) 



Hence we get 



Hxn^yn^) > Iixn;yn^)-Q\ogil + lUP). (23) 

Note that I{x,;y .) is the mutual information across the cut 
il in the discrete superposition network. By accumulating the 
losses in transforming the inputs for the Gaussian channel into 
the corresponding inputs for the deterministic channel in ( fTS] ) 
and (|23ll, we obtain 



Hxn;yn^) > I{xn;yn^)-0{M\ogM), 



(24) 



where xq and ync in the above equation are the respective 
channel inputs and outputs in the original Gaussian cut. Also, 
we began by choosing the inputs x, in the Gaussian network to 
be i.i.d. Gaussian CJ\f{0, 1). At the end of the transformations, 
the channel inputs in the discrete superposition network x^ are 
also independent. This completes the proof. ■ 

B. Lifting a coding scheme for the discrete superposition 
network to the Gaussian network 

A coding strategy for either the Gaussian or superposition 
relay network specifies codewords transmitted by the source, 
a mapping from the received signal to a transmit signal for 
every relay node, and a decoding function for the destination. 
For sake of simplicity of exposition, we assume that the graph 
describing the network is acyclic and that the relays employ 



time-invariant encoding functions. Later, in Section. III-D6 



we mention how to handle more general encoding functions. 

We describe how to lift a coding strategy for the discrete 
superposition network to a strategy for the Gaussian network. 

Consider a (2^^, TV) code for the discrete superposition 
network with zero probability of error, for a certain N . 
The probability of error can be reduced to zero due to the 
deterministic nature of the network; see Sec. |III-D1| 



Denote the block of N transmissions at node j in the 
discrete superposition network by an A^-dimensional transmit 
vector a; , and similarly the received vector by y . All signals 
in the discrete superposition network are a (deterministic) 
function of the codeword Xq transmitted by the source. 

Next, we build a {2'"'-^^,mN) code, denoted by Cq, for 
the discrete superposition network, with every TTiiV-length 
codeword constructed by adjoining m codewords from the old 
code, for a large m. This is again a rate R code since it simply 
uses the old code m times on the superposition network. We 
can visualize the construction of codewords in Cq by referring 
to Fig. [3] 

In the (2™^^, mN) code, node j 

1) breaks up its received signal, denoted by y., into m 
blocks of length N, 

2) applies the mapping used in the (2^^, N) code on each 
of the m blocks to generate m blocks of transmit signals, 

3) and adjoins m blocks of transmit signals to construct a 
new transmit signal, denoted by x,, of length mN . 



1 M M M M M 1 1 M 1 M 1 M 1 M 1 


1 A' syinboLs i | | | 


1 III 


III 


m vectors in the (2'^^,iV) code 


r 


1 1 






Transmit signal in tire (2'"~('^-''-k), W) code 
Fig. 3. Relationship among the signals transmitted by node j. 

As shown in Fig. l3] the relationship between various signals 
associated with the transmission of node j is akin to packeti- 
zation in computer networks. 

A subset of the codewords of Co, defined below, forms the 
set of codewords of the code for the Gaussian relay network. 

Pruning the set of codewords: Node j has a finite set of e- 
strongly typical y.'s (see fSSl) in the code for the superposi- 
tion network. We randomly, i.e., independently and uniformly, 
pick a 2^™(^'^+2'?) fraction of them and denote the resulting 
set by 5j. K > is defined later in ([30| as a function only 
of the number of nodes in the network and not the channel 
gains, while r/ > is specified later and can be made arbitrarily 
small. We repeat this pruning procedure for all the nodes. 

Denote the intersection of the inverse images of Sj in Co, 
for j = 1, 2, • • • , M, by Cg- Transmission of any vector in Cg 
results in the received vector at node j belonging to Sj in the 
discrete superposition network. Cq forms the set of codewords 
for the Gaussian network. 

Encoding and decoding procedure in the Gaussian network: 
The source in the Gaussian network transmits a codeword 
Xq from Cg- Assume throughout that node 1 can listen, i.e., 
has a link, to the source. Node 1 receives a noisy signal and 
decodes to a vector in Si. We will specify in the sequel how 
this decoding is to be done. Then, using the encoding function 
from Co, it constructs its transmit signal. All relay nodes 
operate in a similar way. Finally, the destination decodes its 
noisy reception to a signal in Sm, and maps it to a codeword 
by simply using the decoding function from Cq. 



Note that we are operating the Gaussian network over the 
digital interface naturally defined by the signals transmitted 
and received in the corresponding discrete superposition net- 
work. 

We summarize the main result concerning the hfting pro- 
cedure in Theorem 13.41 



Theorem 3.4: Consider a Gaussian network with a single 
source-destination pair and M — 1 relay nodes, and consider 
a code for the discrete superposition model of the network that 
communicates at a rate R. 

Then, the lifting procedure and the digital interface defined 
by the discrete superposition model yield a code for the orig- 
inal Gaussian network that communicates at a rate R — k^, 
where 

Kk := Af(log(6Af-l) + 10). (25) 

It should be noted that k.^^ does not depend on the channel 
gains. Therefore, the above theorem provides a lifting pro- 
cedure that attains a rate in the Gaussian network within a 
bounded amount of R at any SNR. 

The theorem also proves that the capacity of the Gaussian 
network is at least as large as the capacity of the discrete su- 
perposition network, up to a bounded gap k^^. This completes 
the proof of Theorem 3.2 Also, the above theorem applies 
to any coding scheme for the superposition network and, in 
particular, to an optimal scheme. Since the capacities of the 
Gaussian and the superposition network are within a bounded 
gap, the optimal scheme for the superposition network can be 
lifted to obtain a near-optimal coding scheme for the Gaussian 
network. 

C. A genie-based argument 

Before delving into the details of the proof, we start with 
a genie-based argument explaining the ideas behind the proof 
of The orem |3.4| The theorem is subsequently proved in detail 
in Sec. IIII-DI 

The arguments presented next are not a proof of the theo- 
rem, but are the basis of our understanding of the relationship 
between the Gaussian and the discrete superposition model, 
and motivate the search for a technique to lift a code from the 
superposition network to the Gaussian network. 

Consider 



the networks in Fig. 1(a) and Fig. 1(c) For 
simplicity, assume node 1 transmits a symbol xi and node 

2 transmits a symbol X2 (instead of a block of symbols each) 
from the alphabet for the discrete superposition network. Node 

3 receives 

Vs = [[/ii3]a;i] + [[/i23]a;2] 



in the discrete superposition network in Fig. 1(c) and it 
receives 

ya = hisxi + h2zX2 + z^ 



in the Gaussian network in Fig. l(a)| Rewriting j/3, we get 
2/3 = y3 + (^i3a;i - [/ii3]a;i) + ([/ii3]a;i - [[^i3]a;i]) 

+ (^23a;2 - [h2'i]x2) + {[h25]x2 " [[^23]a;2]) + Z^ 

-■ Va + vs + zs. (26) 



Here we have replaced the actual values of the channel gains 
with appropriate variables. By definition y^ lies in Z + zZ. 
Hence j/g can be recovered from y^, the quantized values [wg] 
and [zs] respectively of v^ and 23, and the quantized carry C3 
obtained from adding the fractional parts of V3 and Z3, with 



C3 



:= [(«3-N) + (^3-N)]. 



(27) 



[v^l and [2:3] are defined as quantized functions of V3 and 23, 
as in ([TJ- 
So, 

2/3 = [ys] - [ws] - [2:3] - C3, 



and 



^(2/312/3) < H{[V3])+H{[Z3]) + H{C3), (28) 



Now, let 



where 



^^3 



Wi3 + W23, 



Wk3 ■= ihk3Xk-[hk3]xk) + i[hk3]xk~[[hk3]xk]), k = l,2. 

Since \hk3 — [hk3]\ < a/2 and Ixfel < 1, the magnitude of 
V3 is less than 2(2-\/2). [v3ii] and [W3/] He in {—5, —4, . . . , 5}, 
and iJ([w3]) < log(22). The real and imaginary parts of the 
carry lie in {0, ±1}, hence H{c3) < 3. Since 23 is distributed 
as CA/'(0, 1), from Lemma [7T| in the appendix, _ff([z3]) < 6. 
Adding up all the entropies and substituting in ( |28] l, we get 
the upper bound 

H{y3\y3) < 14. 

These computations can be repeated for all the nodes in 
Gaussian network. In general, if there are A/ incoming signals 
at a relay node j, then the magnitude of Vj is less M(2-\/2), 
where Vj is similarly defined, as in (|26]l, with respect to the 
signal received by node j. Hence, Vjji and Vjj will lie in the 
set {-3M + 1, -3M + 2, . . . , 3M - 1} then 

H{y'j\yj) < H{[v,]) + H{[z,]) + H{c,) (29) 
< log(6M - 1) + 10, 

where Cj is defined, as in pT] ), with respect to the signal 
received by node j. Let 



log(6Af - 1) + 10 



(30) 



be a function of the total number of nodes and independent of 
channel gains (or SNR). Now we use a code designed for the 
superposition network in the Gaussian network. If there were 
a genie providing H{yAyj) bits of data corresponding to the 
received signal to node j in every channel use, then node j 
can recover y. from y.. Since the genie provides at most k 

bits to every node, it provides a total of at most Mk = k-j^ 
bits per channel use. 

Hence, with the genie's aid, a code designed for the discrete 
superposition network can be used in the Gaussian network at 
any SNR. Our proof below prunes a fraction of the codewords 
representing the information that the genie would have pro- 
vided, so that the decoding can work even without the genie. 



D. Proof of Theorem \3.4\ 

1) Zero probability of error: Consider the (2^^, A^) code 
for the superposition network and assume that it has an average 
probability of error S, where < S < 1/2. Since the 
superposition network is a noiseless network, each codeword is 
either always decoded correctly or always decoded incorrectly. 
Since S < 1/2, less than half of the codewords are always 
decoded incorrectly. Discarding them results in a code where 
all codewords can be successfully decoded, with a small loss 
in the rate. So, without loss of generality, we assume that the 
(2^-^,iV) code (and thus also the (2™^^,mA^) code) for the 
superposition network has zero probability of error. 

Xq, the random variable corresponding to the codeword, has 
a uniform distribution with H{xq) — NR, and induces a 
distribution on the remaining variables in the network. 

2) Operating over blocks of length mN: In the 
^2mNR^ ^jY) code, we assume that every node buffers raN of 
its received symbols, eventually constructing a transmit signal 
of length mN, and transmits it over the next mN channel 
uses. 

For the network in Fig. |l(a)| this is possible since nodes 
can be grouped into levels such that only nodes at one level 
communicate with another level. For example, nodes 1 and 2 



in Fig. 1(a) can buffer their reception till node completes its 



transmission, then construct their transmit signals, and transmit 
to nodes 3 and 4 over the next m,N channel uses. 

For a general network, we need to differentiate between 
signals received by a node at various time instants to account 
for causality in construction of their transmit signals. This 



requires slightly modifying the procedure; see Sec. III-D6 

3} Pruning the code with respect to node 1: Each y. (or 
Xj) in Co is generated by n independent samples from the 
distribution of y (or x ). Choose e > 0. For a sufficiently 

large m, node 1 has a collection of at most 2™' \y_^)+'^-i) ^jj^j 
at least 2™*^ Wi)-<^2) g.gjjongly typical received vectors in the 
discrete superposition network corresponding to Cq (see ll35l ). 
where £2 > 0. As e — >■ 0, e2 — )• 0. With rj set to €2, we 
construct Si by randomly selecting a 2^'"(^''+^'') fraction 
of this collection. We do this by choosing a subset uniformly 
among all the subsets of the appropriate size. \Si \ can be upper 
bounded as follows (see (p9l)-(|30l) 



\S I < 2™'-"^*--!''^^^-' 2^™'^'^+^'') 

Similarly, we can show that |5i| > 2™(^(^'i)^^''"3"^\ 

For a large n, the number of codewords in Cq jointly e- 
strongly typical with a particular y can be bounded inde- 
pendently of the chosen y ; see ll35l . The desired set has 

2™( K^o\v-J <^2) codewords for a particular y , i.e., transmis- 
sion of one of those codewords in the superposition network 
results in node 1 receiving the chosen y . Due to the deter- 
ministic nature of the channel, the sets of codewords in Co 
jointly typical with two different vectors in Si form disjoint 
sets. To construct Co,i, we pick the set of all codewords in Co 
that are jointly e-strongly typical with some vector in Si. We 
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have. 



|Co,: 



E 



(# of codewords in Cq jointly 
e-strongly typical with y ) 



^ 2rn{H{y^)~Nii~'e2) ^ 2'"(-f^(5ol2£i)+'S2) /^l) 

_ 2™-(^(^o-yi')-^'^) (32) 

_ 2"'(-f^(5o)-^'') (33) 

where ( (33] l follows since ^^(j/ Ixg) = 0. Similarly, we can 
show that |Co,i| > 2'"(^(3io)-^'«-4e2)_ 



Codewords 




2niH{x„\y,^,y^) codewords jointly 
e - typical with each (y^,y2) 



Set of y 's 



Set of y^'s 



Fig. 5. Pictorial representation of further pruning with respect to node 2. 



2mH(x„\y^) codewords 
jointly € — typical 
with each y 




respectively, with high probability for a large n, since \Sj 



< 



Codewords Setofy^'s 

Fig. 4. Pictorial representation of pruning with respect to node 1. 

If the source transmits a codeword from Co,i in the Gaussian 
network, then the signal y received by node 1 can be regarded 
as a noisy version of the signal y it would have received 
in the superposition network, as shown in ( p6| ). Therefore, 
we define a channel with input y and output y . Node 1 
decodes by finding a vector in Si that is jointly weakly e- 
typical with the received vector in the Gaussian networlr] 
Since |5i| < 2™^ (l^i'i^i)"'^^)^ decoding is successful with 
block error probability less than (, where C -> as n — !• oo. 

4) Further pruning the set of codewords with respect to 
node 2: There are 2™' ^y.2^y-i> '^J vectors in the set of y 's 
at node 2 that are jointly e-strongly typical with a particular 
y € Si. Since we constructed ^2 by randomly choosing a 
subset containing a 2~"'(^''+2'=2) fraction of the set of all y 's, 

for a large n, there are 2™^ Wjl^^i^^ '" '^j vectors in ^2 
jointly e-strongly typical with each y e 5i. Hence, there 

are 2™^ yy_i'V_2'~ '^ '^^> jointly e-strongly typical vectors in 
Si X ^2 with high probability (whp) as n — > cx). 

Now, 2™(^(-«l^i'^2)±<^2) codewords in Co are jointly e- 
strongly typical with each e-strongly typical tuple in Si x S2- 
We iterate the procedure in the previous subsection by collect- 
ing the codewords in Cq which are jointly e-strongly typical 
with the e-strongly typical tuples in S1XS2, and denote this set 
by Co, 1,2- Naturally, Co, 1.2 is a subset of Co,i. As in ((3T|-((33]l, 
we obtain |Co,i,2| is about 2™(^(3;o)-2W'=±7<^2) ^hp. 

If the source transmits a codeword from Co, 1,2, then nodes 
1 and 2 can correctly decode to vectors in Si and ^2 

^^Since y is a continuous signal, we use weak typicality to define the 
decoding operation. Note that strongly typical sequences are also weakly 
typical; hence sequences in cSi are weakly typical. 



2m(Iiyr,y.) 



^^ for J e {1,2}. 
5) Further pruning with respect to the remaining nodes: 



The same procedure is repeated with respect to the remaining 
nodes in the network. In the end, we obtain a collection of 
at most 2'''iHixo)-MNK+eM) and at least 2™(^(£o)-A^^'=-^J'f) 
codewords whp, denoted by Co,i,... .m ='■ Cq, where e^/ > 0. 
Note that e^/ ^- as e ^- 0. Transmission of a codeword in 
Cg results in the received signal at node j in the superposition 
network belonging to the set Sj. 

Now, if Cg is used on the Gaussian network with encoding 
and decoding procedures at all nodes as described above, then 
the destination can decode to the transmitted codeword whp. 
Thus, on the Gaussian network, Cg achieves the rate 



H{xo)/N-MK-eM/M 



R~Mk- eu/M 
i?-«K-eM/M, (34) 



where e^v/ can be made arbitrarily small. 

6) Interleaving the codewords for general networks: 
mentioned in Sec. IIII-D2 



As 



we need to slightly modify the 
lifting procedure for relay networks which have irregular level 
sets that do not permit straightforward buffering of received 
symbols at a node. 

In this case, codewords in Cq are constructed by adjoining 
N blocks of m symbols each, where the first block 3;o(l) 
consists only of the first symbols of m codewords of the 
{2^^,N) code, the second block a;Q(2) consists only of the 
second symbols of the same codewords, and so on. The source 
transmits XQ(t)'s in the order of increasing t. 

In the (2^-^, N) code, let y^(i), t = 1, . . . , TV, denote the 
t-th symbol received by node j. We adjoin the t-th received 
symbols from m uses of the code to construct y.{t). Since 
Xj{t), the i-th symbol transmitted by node j, is a function 
of {j/,(p)}i^\, node j can construct xAt), vector consisting 
of the t-th transmit symbols from m uses of the code, after 
receiving {y^.(p)}p^\, . 

Essentially, we interleave the symbols from m uses of the 
same code to ensure that the nodes can buffer their receptions. 

In order to lift the coding scheme to the Gaussian network, 
we prune Cq by randomly picking a 2^™''*+'^''' -fraction of the 



11 



set of e-strongly typical y (t), for all t, for all j, and collecting 
the codewords jointly e-strongly typical with them to form Ca- 
in the Gaussian network, each node buffers its reception 
for m time units, decodes to the appropriate y (i), constructs 
Xj{t + 1), transmits it on the next m time units. The des- 
tination decodes individual m-length blocks to get y it), 
t — 1, 2, . . . , A^, and decodes to a codeword in Cq after de- 
interleaving {y^/0}- 



This completes the proof of Theorem 3.4 



IV. A DIGITAL INTERFACE FOR GAUSSIAN INTERFERENCE 
CHANNELS 

In this section we show that the capacity regions of the 
Gaussian interference channel and the discrete superposition 
interference channel are within a bounded number of bits, 
independent of channel gains or SNR. This result was proved 
for the case of 2 x 2 interference channel (and a slightly 
different deterministic model) in fSl, and we use some of 
the techniques here. Also, similar to the case of the relay 
network, we develop a systematic way to 'lift' any code for 
the discrete superposition interference network to the Gaussian 
interference network, and establish that it does so with no more 
than a bounded loss in the rate. 




Fig. 6. K X K interference network. 

Consider the KxK Gaussian interference network shown in 
Fig. l6] There are K sources, labeled 1,2, ... ,K, who want to 
transmit data to K destinations, labeled 1,2, ... ,K. All the 
sources are independent of each other, and the k-th source 
wants to transmit information only to the fc-th destination. 
At the fc-th destination, the transmissions by other sources 
interfere with the transmission of the fc-th source. Let Xi be 
the signal transmitted by the i-th source, for i — 1,2, . . . ,K. 
The received signal at the j-th destination is 



Vj 



K 

E 
fe=i 



hkjXk + Zj, j = 1,2,.. 



,K, 



(35) 



where Zj is CAf(0, 1). 

Consider the corresponding discrete superposition model 
for this interference channel. The received signal at the j-th 
destination in the discrete superposition model is 



Vj 



K 

E 

fc=i 



1,2, 



[[hkj]xk\, i 
where [hkj] is the quantized version of h^j. 



,K, 



(36) 



Consider a block code for the interference network, either 
the Gaussian version or the discrete superposition version. 
Such a (2^-"i , 2^-^2 _ ^ ^ 2^^^' , N) code for the interference 
channel is defined by an encoding function for each source 



{1,2, ...,2^''^^} -^A"^, forfc = l,2 



,K, 



where X is the input alphabet of the channel, and a decoding 
function for each destination 



gu-.y^ ^{1,2, 



■>NRu 



}, fox k^ 1,2,..., K, 



where 3^^ is the output alphabet of the channel at destination 
node fc. Let M.j be a random variable uniformly distributed 
on {1, 2, ... , 2^^^ }, for each j, corresponding to the message 
that source j wants to communicate to destination j. Mj is 
mapped to the codeword xAAij). The average probability of 
error is given by 

Pe = Pr{gk{yJ ^ Mk, for some fc), 

where y is the signal received by destination fc. The capacity 
region is the collection of all rate tuples R such that for any 
e > 0, there exists a blocklength N for which Pg < e. 

All rate vectors in the capacity region are referred to as 
achievable rate vectors. The next lemma states the equivalence 
between achievable rate vectors and a collection of multi-letter 
mutual information terms. 

Lemma 4.1: A rate vector R lies in the capacity region of 
the KxK interference channel if and only if there exists 
a blocklength N, and a collection of independent random 
variables {xk, k — 1,2, ... , K} such that 

Rk < ^Iix,;yJ,fork = l,2,...,K, (37) 



N 



where y is the received signal at the k-th receiver 

Proof: The proof of this lemma is similar to the proof 



of Lemma 3.1 and we need to consider the rate achieved by 
every source-destination pair individually. We skip the details. 

■ 

The main result of this section is: 

Theorem 4.2: Consider the KxK interference channel 
described above. The capacity region of the Gaussian interfer- 
ence channel and the discrete superposition interference chan- 
nel are within a bounded gap, where the gap is independent 
of channel gains or SNR. 

If Rq is a rate vector in the capacity region of the 
Gaussian interference channel, then there is a rate vector Rj^ 
in the capacity region of the discrete superposition interference 
channel such that 



\Rk,G - Rk.D\ < Ki, for k^ 1,2, 



,K, 



where 



Kl 



&K + \og{lUK + l). 



(38) 



(39) 



Conversely, a coding scheme for the discrete superposition 
interference channel corresponding to a rate vector Rjj can 
be lifted to the Gaussian interference channel to obtain an 
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achievable rate vector Rq with a loss of at most Kj bits, 
where 

Kj- := log(6ii: - 1) + 10. (40) 

We prove the theorem in a series of steps. First, using 
Lemma |4.1| we convert any achievable rate tuple for either 
the Gaussian or discrete superposition network into a set of 
mutual information terms. Then, by analyzing these mutual 
information terms, we prove that any coding scheme for the 
Gaussian channel can be transformed into a coding scheme 
for the discrete superposition channel, with at most a bounded 
loss in the rate. Similarly, we prove that any coding scheme for 
the discrete superposition interference channel (DSIC) can be 
lifted to the Gaussian interference channel (GIC), again with 
at most a bounded loss in the rate. 

A. Capacity(DSIC) C Capacity(GIC), to within an additive 
constant 

To begin, we prove that the capacity region of the discrete 
superposition interference network is at least as large as that of 
the Gaussian interference network, minus a constant number 
of bits. 

Lemma 4.3: Let Rq be a vector in the capacity region 
of the Gaussian interference channel. Then, there exists a 
rate vector R^ in the capacity of the discrete superposition 
interference channel such that 



Rok > Rak- Hi, for k = 1,2,..., K, 



(41) 



with Kx defined in \2>9). 



Proof: From Lemma 4.1 we know that we can replace 
any achievable rate vector Rq by a collection of mutual 
information terms {/(a;^,; y,), k — 1,2, ... , K}. We perform 
a series of approximations to convert the mutual information 
I{3ij\y ) corresponding to the rate of the j-th source to a 
mutual information term for the discrete superposition interfer- 
ence channel, incurring a loss no more than a bounded amount. 



Rest of the arguments are similar to the proof in Section III-A 



The received signal at the j-th receiver in the Gaussian 
interference channel is 



y^ 



K 

E 

fe=i 



h-kj -Lk + ^j 



(42) 



with the transmitted signals satisfying an average unit power 
constraint. 

Note that Xj. = {xki, Xk2, ■ ■ ■ , Xkn), and each Xkm can be 
split into its quantized part [xfe™] and fractional part Xkm, 
where 



•^kn 



•^kn 



[•^kj] 



We discard [xkm] and retain Xkm- Since x^m satisfies a unit 
average power constraint, Xkm also satisfies a unit average 
power constraint. Define 



K 



hh-i 






(43) 



Denote the discarded portion of the received signal by 

K 

Y^hu,[xk]. (44) 



y, 



k = l 



Comparing the mutual information corresponding to the j-th 
source-destination pair for channels ( |42] l and ( |43| ), we get 

NRq, = Iixy,y^) 

= I{xj , [xj] ; yp + I{x^ , [x^] ; y^ \y^ ) 

= Hxf,y^)+I{Xj,[Xj];y^\y^) (45) 

< I{x^;y.) + H{y^) 



K 



< /(%;t,) + E^(fefe])' 



(46) 



k = l 



where ( |45| ) follows because [xA ^ Xj -^ y . form a Markov 
chain, and ( |46l ) holds because ijjn is a function of {[xfe„]} 
from (|44]i. It is proved in the appendix that H{[xkm]) < 6, 
hence we get 



^(%;yp > i{xf.v^) 



QKN. 



(47) 



Since XkmR and Xkmi lie in (^Ijl). we obtain positive 
inputs by adding 1 to each. This is equivalent to adding 
Tlik ^fci(l + 1) to jjjm- Denoting by v_ the vector of (1 + i)'s, 
we also divide by 2\/2 throughout to get: 

fc 
= ^hkj{xk + v)/2y/2 + z^/2y/2. (48) 



Note that 



I{ij\h 



~ 1^-^' 2V2 '-^■' 2^2 

I 2^/2 ' 2^2 i ' 



To avoid introducing new notation, for the rest of the proof 
we abuse notation and denote the left hand side of ( |48| ) by y ., 

(ifc + v)l2^ by Xfe, and z^/2V2 by z^, for all j. 

With the new notation, \xkrn\ < 1. with positive real and 
imaginary parts, and Zj,„ is distributed as CN{Q, 1/8). 

The features of the model that we next address are: 

1) channel gains are quantized to lie in Z + iL, 

2) real and imaginary parts of the scaled inputs are re- 
stricted to 

n := max max{[log|/iij_R|J, [log|/iij/|J} 

bits, 

3) there is no AWGN, and 

4) outputs are quantized to lie in Z + iL. 
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Let the binary 

0.a;femfl(l)a:fc„ifl(2) . . 

^kraR 



expansion 
i.e.. 



of ^/2 



•^kmR 



be thereby proving the lemma. 



^ CX3 

^ p—1 



The received signal in the discrete superposition channel only 
retains the following relevant portion of the input signals: 



y. 



= Et^ 



kjl 



'±kh 



(49) 



where 



•^kmR 



^kml 



1 " 



V2 



XkraR{p)'2. 



p=l 
1 " 

^ p—i 



■^kml 



b)2-^ 



with n defined in (|3]l. To obtain ( |49l ) we subtracted (5 from 



y , where 

-3 



K 

E 

k=l 



hkj{xk - Xfe) + {hkj - [/('fcjD^/c + (50) 



([/lfej]Xfc- [[/lfcj]Xfe]) 



+ z, 



E^ 



-kj + ^i 



■ -J ' -V 

To bound the loss in the mutual information in the discrete 
superposition network from the original Gaussian interference 
network, we have 

= i{x-j\y.) + i{xj\5_j\y_j) 

= I{Xj \y.) + h{Sj \y^ ) - h{Sj \y^ , x^ ) 
< I{s.j\y^) + ^(5j) - h{5_j\{.,Xj,Vj) 

= iisij;y^) + HSj)-h{z^) 
= i{xj]y.) + i{vj-A^)- 

By bounding the magnitudes of the terms in (|50|l, we get 
|w/cjm| < 3\/2- So, I {v.: -,5. A is the mutual information of A^ 
uses of a Gaussian MISO channel with average input power 
constraint less than {3V^)^K < 18K and 



I{Vj;S^) < N\og{l + 18K/{l/8)) 
< iVlog(l + 144ii:). 



(51) 



Note that I{x^;y.) is the mutual information between the 
input and output of the j-th source-destination pair in the 
discrete superposition interference channel. By Lemma |4.1| 
this mutual information translates into an achievable rate 
Rdj = I{x^; y )/N . By accumulating the losses in transform- 
ing the inputs for the Gaussian channel into the corresponding 
inputs for the deterministic channel, we obtain 



Rdj > RG3-{6K + log{l + U4K)), 



(52) 



B. Capacity(GIC) C Capacity(DSIC), to within an additive 
constant 

Next we prove that the capacity region of the Gaussian inter- 
ference channel is at least as large as that of the discrete super- 
position interference channel, to within an additive constant. 
Once again, we prove this by converting a coding scheme 
for the discrete superposition interference channel to a mutual 
information expression using Lemma |4.1| and bounding the 
loss in the mutual information when transforming the discrete 
superposition model to the Gaussian model. The proof of the 
lemma below contains all the ideas on lifting a code from the 
discrete superposition model to the Gaussian model, and we 
explicitly mention them in a subsequent subsection. 

Lemma 4.4: Let Rjj be a vector in the capacity region of 
the discrete superposition interference channel. Then, there 
exists a rate vector Rq in the capacity of the Gaussian 
interference channel such that 



Rak > Rok — K-i, 



(53) 



for k = 1,2, . . . , K, where Kj^ is defined in ( [40] 



Proof: Pick any coding scheme for the discrete super- 
position interference channel which achieves the rate tuple 
Rd. From Lemma 4.1 we know that this corresponds to 
a collection of mutual information terms {I{x^.;y,), k = 
1,2,..., K}. Note that here x^. corresponds to the input of the 
fc-th source in the discrete superposition interference network, 
and y corresponds to the output of the k-th destination in the 
discrete superposition network. Now, we use the same input 
distribution on the Gaussian interference network and show 
that the mutual information corresponding to the rate of the j- 
th source does not decrease by more than a bounded amount. 
The same arguments will be applicable to the remaining 
mutual information terms. 

Since every element of the vector Xj, satisfies a peak power 
constraint, it also satisfies an average power constraint. Hence 
it can be used as an input to the Gaussian channel to get 



y^ 



Since we know that 



k 



h'kjSLk ~t" ?Lj 



= Y.^[hkA^ 



-feJ' 



(54) 



(55) 



we have 



K 

y, + J2^kj+^j 

k=l 



(56) 



where Wj. is defined as 



yikj — {hkj - [hkj\) x^ + {[hkj] Xf, - [[hkj\ Xf,])). 

By definition j/ is a vector of entries from Z + iL. Hence y . 
can be recovered from y , the quantized values of u and z , 
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and the vector of carries c obtained from adding the fractional 
parts of Vj and z . So, 

< I{x^;y^) + Hi[v^]) + Hi[z^]) + H{c^), (57) 

where [vA and [z ] are defined as quantized functions of w 
and Zj. Similar to the arguments preceding ( p9| ), we get 

Hi[v^]) < iVlog(12X-2), 

H(cj) < 3N, 
H{[z^]) < 6N. 

In ( |57] i, I{xj\y.) corresponds to A^ times the rate Rcj 
achieved on the Gaussian interference channel by the j-Xh 
source. Therefore we get 



R. 



Gj 



> Rdj - log(6is: - 1) - 10. 



Lemmas 4.3 and 4.4 together complete the proof of Theo- 
rem |42l 



C. Lifting codewords to the Gaussian interference network 

Since the construction of a digital interface and the pro- 
cedure of lifting codewords from the discrete superposition 
interference channel to the Gaussian interference channel was 
implicit in the proof of the above lemma, we summarize the 
procedure below: 

• Consider any coding scheme for the discrete superposi- 
tion interference channel that achieves a rate tuple R^ 

it can 



with probability of error e. Using Lemma 4.1 



be converted to a collection of mutual information terms 
{/(xj,; y, )}, where the fc-th term corresponds to the rate 
of the /c-th source-destination pair, and a collection of 
independent input distributions {p{x_fA)}, where the /c-th 
distribution corresponds to the input distribution for the 
fc-th source. The input distribution p{xf.) is over iV-length 
vectors, where A'^ is the length of the codewords for the 
discrete superposition interference channel. 

• We construct a mA^-length codeword for the fc-th 
source by picking m vectors independently with the 
distribution p{xf,), and adjoining them. We construct 
2mJV(i?.D3-Kx) codewords this way. This set forms the 
(2™^(^"3"''i),miV) code for the j-th source in the 
Gaussian interference network. Similar to the lifting 
procedure for the relay network, we can visualize the 
construction of codewords in the (2™^(^^3-''2;\mA^) 
code by referring to Fig. [3] 

• In the Gaussian interference channel, with joint typical 
decoding, the fc-th decoder can recover the codeword 
transmitted by the fc-th source with probability of error 
less than e, as we allow m to tend to oo. This is essentially 
proved in Lemma |44] 

Therefore, we can operate the Gaussian interference channel 
on the digital interface defined by the signals transmitted and 



received in the discrete superposition interference channel 
We can use techniques similar to the proof of Theorem 3.4 



to prove Lemma 4.4 but we chose to present the simpler 
proof involving manipulating mutual information terms di- 



rectly. From Lemmas 4.3 and 4.4 we know that the capacities 
of both the networks are within a bounded gap. Hence, if 
we choose a near capacity-achieving coding scheme for the 
discrete superposition interference network, we can transform 
it by following the above procedure for lifting codewords 
and obtain a near-optimal digital interface for operating the 
Gaussian interference network. 

V. Digital interfaces for some other networks 

In this section, we list some other networks for which the 
discrete superposition model provides a near-optimal digital 
interface, and where codes can be lifted from the discrete 
superposition counterpart to the Gaussian network. 

A. MIMO networks 

From the results in fl] and ||2i|, it is easy to see that MIMO 
channels are well-approximated by the discrete superposition 
model in the capacity sense, and we can lift codes from a 
MIMO discrete superposition channel to a Gaussian MIMO 
channel. This correspondence extends to more general MIMO 
networks too. 

1) MIMO relay networks: MIMO relay networks can be 
handled in the same way as in Section |lllj by treating 
each transmitted/received signal as a collection of vectors, 
where the size of the collection depends on the number of 
transmit/receive antennas. 

For simplicity, consider a relay network where every node 
has two transmit and two receive antennas. All transmitted 
and received signals are a pair of vectors in both the Gaus- 
sian and discrete superposition model for this network. For 
example, node j's received signal in the Gaussian model is 
y. = [y. ,,y. g]- The channel on a particular wireless 

link {i,j) is specified by four channel gains, {/i-- }, where 
fc e {1,2} indexes the transmit antennas of i and I £ {1,2} 
indexes the receive antennas of j. Assuming that the noises at 
both receive antennas are distributed as C7V(0, 1), we have 



^j,i 



i&^f{3) 



(Mf^ 



,2.1 



^i.i: 



; = 1,2. (58) 



We state the counterpart of Theorems 3.2 and 3.4 



Theorem 5.1: Consider a relay network where every node 
has a maximum of L transmit or receive antennas. The 
capacity Cq of the Gaussian relay network and the capacity 
Cd of the discrete superposition relay network is within a 
bounded gap of k-ji^l bits where 



>in.L - 0{LM\og{LM)). 



(59) 



Furthermore, a coding scheme for the discrete superposition 
MIMO relay network can be lifted to the Gaussian MIMO 
relay network with a loss of k^ ^ bits in the rate, where 



^V.,L 



LM(log(6LM-l) + 10). 



(60) 
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Proof: The arguments in Section [III] unchanged to this 
model by replacing every vector with its corresponding tuple 
of vectors. A simple way to derive the above results is to 
replace each set of L antennas at a transmitter or receiver 
by a collection of L virtual nodes. Now there are a total of 
LM virtual nodes in the network. The bound in ( |59l ) follows 
by replacing M with LM in Theorem |3.2| In order to lift 
the code from the discrete superposition MIMO network to 
the Gaussian relay network, we need to prune the source's 
codebook with respect to all the virtual nodes. Hence ( [SO] ) 
follows by replacing M in (|25]l with LM. ■ 



2) MIMO interference networks: The results in Theo- 
rem |4.2| can be extended io KxK interference channels where 
each transmitter and destination have multiple antennas. Once 
again, the constant determining the bounded gap is a function 
of the number of nodes in the network, as well as the number 
of transmit and receive antennas at the various nodes. 

Theorem 5.2: Consider the K x K MIMO interference 
channel described above, where every node has a maximum 
of L antennas. The capacity region of the Gaussian MIMO 
interference channel and the discrete superposition MIMO 
interference channel are within a bounded gap of kx^l bits, 
where 

Kx,L := 6LK + L\og{U4:LK + l). (61) 

Furthermore, a coding scheme for the discrete superposition 
MIMO interference channel can be lifted to the Gaussian 
MIMO interference channel with a loss of Kj^ j^ bits in the 
rate, where 



^I.L 



L{\og{6LK -1) + 10). 



(62) 



Proof: The proof of this theorem is similar to that of 
Theorem |4.2| We can treat each transmitting antenna and 
receiving antenna as a virtual node. For proving ( |6T] i, we 
compare with (|46| and note that in this case there are at most 
KL virtual transmitters that contribute 6LK to the bound. 
Since there are L virtual receivers at every node and each one 
contributes log(144L/ir+l) to the bound, where the arguments 
are similar to those preceding ( |5T] i, the total contribution of 
the virtual receivers is ilog(144i/f + 1). Adding up the two 
contributions, we get the bound in ( |6T| ). 

The bound in ( |62] l can be proved using the techniques in 
Lemma |4.4| Each of the L virtual receivers at a particular 
receiver contributes log{QLK — 1) + 10 to the bound, with a 
total of contribution of L{\og{6LK - 1) + 10) due to all the 
virtual receivers at a node. 



B. Multicast 

Consider a relay network with M + 1 nodes, with the nodes 
labeled as 0,1,..., Af, where node is the source and it 
wants to communicate the same information to a subset of the 
remaining nodes. The other nodes which are not the intended 
recipients act as relays. It is known that the cut-set bound 
is the capacity of such multicast networks, up to an additive 
constant LU, L2J. 



Theorem 5.3: Consider a multicast relay network men- 
tioned above. The capacity C'a of the Gaussian relay network 
and the capacity Cd of the discrete superposition relay 
network is within a bounded gap of km bits where 



KM = 0(M\og{M)). 



(63) 



Furthermore, a coding scheme for the discrete superposition 
multicast relay network can be lifted to the Gaussian multicast 
relay network with a loss of Km bits in the rate, where 



^M 



M(log(6A//-l) + 10). 



(64) 



Proof: From the results in [3| and [1], it is easy to 
prove the bound in ( |63] l. The basic idea is that the cut-set 
bound is approximately achievable for Gaussian and discrete 
superposition networks. Hence we need to prove that the 
cut-set bounds for the Gaussian network and the discrete 
superposition network are within a bounded gap. This is 



proved in Theorem 3.2 



For proving the results in (|64J, we choose a coding scheme 
for the multicast discrete superposition network. The key to 
lifting the coding scheme to the Gaussian network is that all 
the intended destinations in a multicast network are decoding 
the same data. Hence if we prune the source's codebook with 



respect to all the nodes, as in the proof of Theorem 3.4 then 
the pruned code can be decoded on the Gaussian network. We 
skip the details. 

We can extend the above theorem to the case when the 
nodes have multiple transmit and receive antennas. ■ 

VI. Concluding remarks 

One of the main problems in network information theory 
is computing the capacity region of a large network with 
many sources, many destinations, and arbitrary data transmis- 
sion requirements. Currently, this looks like no more than a 
distant possibility. As suggested by Q, a possibly simpler 
aim is to approximate a general network with a deterministic 
model, perhaps with the discrete superposition model. We 
have proved that the discrete superposition model serves as 
a near-optimal digital interface for designing codes for the 
Gaussian relay network and the Gaussian interference network. 
This transforms the problem of designing near-optimal codes 
for the Gaussian network to designing near-optimal codes for 
the discrete counterpart. Also, the problem of computing the 
capacity of the Gaussian network is reduced to a combinatorial 
problem of computing the capacity of a discrete network. 
In case of the relay network, even though we already know 
near-optimal coding schemes for the network, it may still be 
helpful to construct simple codes for its discrete superposition 
counterpart. Such simple schemes can be directly translated, 
via the lifting procedure proposed earlier, to construct simple 
coding schemes for the original Gaussian network. In the case 
of the Gaussian interference network, computing the capacity 
region of the 3x3 discrete superposition interference channel 
will yield the capacity region of the original Gaussian network 
to within a constant, and will improve our understanding 
of larger practical interference networks. We have not been 
able to prove the near-optimality of the discrete superposition 
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model in approximating the capacity of a general Gaussian 
network. 

A better understanding of the limits of approximating Gaus- 
sian networks with noiseless deterministic models will help us 
in computing the fundamental limits of wireless networks with 
many users and may also help in designing coding schemes 
for them. 

VII. Appendix 

A. Maximum entropy under a unit power constraint 

Lemma 7.1: Let x be a random variable whose domain is 
Z + i1, with i?[|a;p] < 1. Then the entropy of x is bounded 
as H{x) < 6. 

Proof: The entropy of x can be bounded as 

H{x) = H{xb. + ixi) 

< H{xr)+H{xi). 

Now 

H{xr) = H{&\gn{xR),\xR\^) 

< H{sign{xR))+H{\xR\^) 

< l + H{\xRf). 

Let z = \xr\'^. The domain of z is the set of non-negative 



integers. Since E[\x\'^] < 1, and E[z] = £;[|a 



xr\ 



< 1. 



If E[z] < 1, then p{z = 0) > 0. Since the alphabet of 
z is countably infinite, there is always a fc e Z+ with 
p{z ~ k) < p{z = 0), with the LHS possibly zero. Now 
mixing the probability distribution of z by replacing each 
of p{z = 0) and p{z — k) by the average of the two 
probabilities will increase the entropy (see Ii35i ). and will 
also increase the mean. Hence the entropy of z, subject to 
E[z] < 1, is maximized when E[z] = 1. For a given 
mean, the geometric distribution maximizes the entropy among 
all discrete distributions (see [35 1). Since the entropy of a 
geometric random variable over the non-negative integers with 
unit mean is 2, iJ(|x7^p) = H{z) < 2. 

Hence H{xr) < 3. Similarly, we can prove that iJ(a;/) < 
3. Combining the two bounds, we get the statement of the 
lemma. 
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